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Abstract 

We consider a tree that grows randomly in time. Each time a new vertex appears, 
it chooses exactly one of the existing vertices and attaches to it. The probability that 
the new vertex chooses vertex x is proportional to ty(deg(x)), a weight function of the 
actual degree of x. The weight function w : N —*■ R+ is the parameter of the model. 

In ^ and in the authors derive the asymptotic degree distribution for a model that 
is equivalent to the special case, when the weight function is linear. The proof therein 
strongly relies on the linear choice of w. 

Using well established results from the theory of general branching processes we give 
the asymptotical degree distribution for a wide range of weight functions. Moreover, we 
provide the asymptotic distribution of the tree itself as seen from a randomly selected 
vertex. The latter approach gives greater insight to the limiting structure of the tree. 

Our proof is robust and we believe that the method may be used to answer several 
other questions related to the model. It relies on the fact that considering the evolution 
of the random tree in continuous time, the process may be viewed as a general branching 
process, this way classical results can be applied. 


1 Introduction 

In models of randomly growing networks, the concept of preferential attachment means 
that when a new vertex appears, it is more likely that it attaches to an already existing 
vertex if the latter already has more neighbors. One of the realizations of this idea is the 
Barabasi - Albert graph. In this model, a tree grows in discrete time-steps: at each step 
we add a new vertex and connect it to an already existing vertex. This choice is made 
randomly, using probabilities exactly proportional to the degree of the existing vertices. 
This model reproduces phenomena observed in certain real-world networks (see [2]), the 
power-law decay of the degree sequence, for example. This was proved in a mathematically 
precise way in and, independently, in HH. 

The probability with which a newly appearing vertex chooses its neighbor could depend 
on the degrees of the existing vertices in a more general way. We define a weight function, 
u; : N —> R+, and let the probability be proportional to this function of the degree. In ^ 
and HU, w is linear, their techniques strongly depend on martingales that are apparent in 
the system only in this special case. 
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General weight functions are considered in the model of Krapivsky and Redner j^]. 
There w{k) ~ , and non-rigorous results are obtained, showing the different behavior for 

7 > 1 and 7 < 1. In the first region the limiting object does not have a non-trivial degree 
sequence: a single dominant vertex appears which is linked to almost every other vertex, 
the others having only finite degree. This statement is made precise and proved rigorously 
in (See also [21 for a related model.) The weight functions we consider will be such 
that this does not happen. 

Our main results are the following. We determine the asymptotic distribution of the 
degree sequence, which equivalently gives the limiting distribution of the degree of a (uni¬ 
formly) randomly selected vertex. We also look deeper into the structure of the tree: we give 
the asymptotic distribution of the subtree under a randomly selected vertex. Moreover, we 
present the asymptotic distribution of the whole tree, seen from a randomly selected vertex. 
For a general approach for asymptotic distribution of random subtrees of random trees, see 

These results give greater insight to the limiting structure of the random tree. 

The key of our method is to place the process into continuous time in such a way that 
it hts in to the well-developed theory of general branching processes (see Section El). The 
asymptotic behavior of the continuous time model gives that of the discrete time model, as 
pointed out in Section El A special case of our model (when w is linear) is equivalent to 
those studied in [^ and HU. The robustness of our method is apparent in the fact that it 
gives a.s. results for a wide class of weight functions, it generalizes the results mentioned 
above. 

The class of weight functions for which our theorems hold are not fully identified by the 
condition we make: it is sufficient, but we do not claim that it is also necessary. In the 
present paper we do not intend to analyse all the ways we believe this new approach can 
be used. We are planning to apply this technique to answer many other questions related 
to the randomly growing tree model. 

The paper is organized as follows. We introduce the terminology and notation in the 
next section, then in Section El we define the model. We state our results in Section 0 and 
show the simplifications that arise in the linear case. After that, in SectionElwe give a brief 
introduction to the theory of general branching processes, and quote the theorems that we 
rely on. We present our proofs in Section El We comment on the asymptotic growth of the 
tree in the last section. 


2 Rooted ordered trees: terminology and notation 

Throughout the paper it will be convenient to use genealogical phrasing. We will consider 
our tree evolving as a genealogical tree: where the individuals are the vertices and the 
parent child relations are the edges of the tree. It will be also convenient to follow the ‘birth 
orders’ among children of the same parent. For this purpose we will consider our trees 
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always as rooted ordered trees (sometimes called family trees or rooted planar trees). In the 
following paragraphs we introduce the (commonly known) terminology for these trees and 
also some additional notations needed for our theorems. 

2.1 Vertices, individuals, trees 

We will label the vertices of a rooted ordered tree using a subset of 

OO 

where Z+:= {1, 2,... }, Z° := {0}. 

n=0 

0 denotes the root of the tree, its children are labelled with {1, 2,... }, and in general the 
children of x = (xi,X 2 , ■ ■ ■, Xk) € M are labelled by (xi, X 2 , • • •, Xk-, 1), (xi, X 2 , • • •, Xk, 2),.... 
Thus if a vertex has the label x = (xi, X 2 ,..., x^) e AA then this means that it is the x^*^ child 
of its parent, which is the x^*L^ child of its own parent and so on. If x = (xi,X 2 , • • • ,Xfc) 
and y = {yi,y 2 , ■ ■ ■ ,yi) we will use the shorthanded notation xy for the concatenation 
(xi, X 2 ,..., Xk, yi,y 2 , ■ ■ ■, yi), and with a slight abuse of notation for n G Z_|_ we use xn for 
(xi,X2,... ,Xfc,n). 

We will identify a rooted ordered tree with the set of labels of its vertices, since this 
already contains the necessary information about the edges. It is clear that a G C M may 
represent a rooted ordered tree if and only if 0 G G and for each (xi,X 2 ,... ,Xk) G G we 
have (xi, X 2 , • • •, x^-i) G G as well as (xi, X 2 ,..., x^ — 1) G G, if x^ > 1. 

The set of hnite rooted ordered trees will be denoted by Q. We think about G £ Q as 
an oriented tree with edges pointing from parents to children. The degree of a vertex x G G 
is just the number of its children in G: 

deg(x, G) := max {n G Z_|_ : xn G G}. 

The generation of G G is 

G[n] := {x G G ; |x| = n}, re > 0, 

where |x| = re iff x G Z” . 

The re**^ ancestor of x = (xi, X 2 , • • •, Xk) G Af with fe > re is x"' = (xi, X 2 , • • •, Xk-n) if 
k > n and x"" = 0 if A: = re. 

The subtree rooted at a vertex x G G is: 

Gix ■= {y.xy£ G}, 

this is just the progeny of x viewed as a rooted ordered tree. Also, (again with a slight 
abuse of notations) for an x = (xi,X 2 ,... ,Xn) G Af with |x| = re > A: we use the notation 
Xjfc = {xn-k+i,Xn-k+ 2 , ■ ■ ■, Xn)- This would be the new label given to x G G in the subtree 
Gy^k . 
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Consider a. G & Q. An ordering s = (soiSi,... ,S|c|_i) of the elements of G is called 
historical if it gives a possible ’birth order’ of the vertices in G, formally if for each 0 < 
* < |G| — 1 we have {sQ) si,..., s^} G Q. The set of all historical orderings of G G ^ will be 
denoted S{G). For a fixed s G 5(G) the rooted ordered trees 

G(s,i) := {so,si,...,Si} C G (1) 

give the evolution of G in this historical ordering s. 

2.2 Random trees 

Throughout the paper we will use Greek letters to denote random elements (of various 
distributions) selected from M and Q: 

r,T,--eg 

Our results will deal with some asymptotic properties of a randomly chosen vertex in 
a certain random tree. We will investigate the asymptotic distribution of its degree, its 
progeny and also the progeny of its the ancestor. In order to study the latter object, 
we introduce rooted ordered trees with a marked vertex in generation k: 

:= {(G,n) G g X : u G G[fc]}. 

g{0) 

is identified with g, since generation 0 consists of only the root, 0. We can use the 
elements of to describe the progeny of the k^^ ancestor of a random vertex: G is an 
ordered tree rooted in the k^^ ancestor of the selected point and u G G[fc] is the position of 
the random vertex in this tree. Clearly, if (G, u) describes the progeny of the ancestor, 
then for 0 < / < A: the progeny of the ancestor is described by {G^^i,uik-i)- 

Thus if is a distribution on g^^'t which describes the progeny of the kk'^ ancestor of 
a chosen vertex, then, if / < A:, the distribution of the progeny of the ancestor is: 

:= ttW ({(G,u) G g^^^ : G^,. =G,v = 

The sequence of probability measures on A: = 0,1,2,... is called consistent if for 
any 0 < I < k, the identity holds. 

Without presenting the precise formulation, it is clear that a consistent sequence of 
probability measures on g^^'i gives full insight to the limiting structure of the tree as seen 
from a random vertex, see Remark after Theorem |2 
We call a probability measure vr on ^ steady if 

^7r(R) ^ l{H^^ = G} = 7r{G). (2) 

H£Q 
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It is easy to check that in this case, for any A; = 1,2,, the similar identity 

J2AH) HH^^ = G} = 7riG) 

H&g 

follows. Eqnivalently, for any bonnded fnnction ^ > M and any /c = 0,1, 2,... , 

E(|r[,]|99(rig)) =E(^(r)), 

where F is a random element of Q with distribution P(r = G) = vr(G), and on the left hand 
side ^ is a random vertex selected uniformly from the A:-th generation of F. (We don’t have 
to worry about the fact that Fmay be empty, since in that case the expression |F[fc] | <y7(Fjg) 
is automatically 0.) Immediate consequences of this property are that the expected size of 
the generation is 1 for any A: € N (choose tp identically 1), and therefore the expected 
size of the whole tree is infinite. 

Backward extensions: Given a steady probability measure vr on Q, define the probability 
measures on = 0,1,2..., by 

7r(*’)(G,u) :=7r(G) 

One can easily check that, due to the steadiness of the distribution vr, the sequence of 
probability measures on A: = 0,1, 2 ... , is consistent. 


3 The Random Tree Model 


3.1 Discrete Time Model 


Given a weight function rc : N —> M+, let us define the following discrete time Markov chain 
Trf on the countable state space Q, with initial state Trf(O) = {0}. If for n > 0 we have 
T(i(n) = G, then for a vertex x € G let A: := deg(x,G) + 1. Using this notation, let the 
transition probabilities be 


P(Trf(n + l) = GU{xA:}) 


-»;(deg(a;,G)) 

Ej/GG^(deg(y,G))' 


In other words, at each time step a new vertex appears, and attaches to exactly one 
already existing vertex. If the tree at the appropriate time is G, then the probability of 
choosing vertex x in the tree G is proportional to w(deg(x, G)). 


3.2 Continuous Time Model 

Given a weight function re : N —> M+, let X(t) be a Markovian pure birth process with 
X(0) =0 and birth rates 

P (X(t + dt) = A: + 11 X(t) = A:) = w(k) dt + o( dt). 
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Let p : [0, oo) i—> (0, oo] be the density of the point process corresponding to the pure birth 
process X{t) and p : (0, oo) —> (0, oo] the (formal) Laplace transform of p: 


OO n—1 


n QQ I L ± 

pW ■= e“^V(i) dt = X] n 


w{i) 


A + wii )' 

n=0 i=0 ' / 


(3) 


The rightmost expression of p{\) is easily computed, given the fact that the intervals be¬ 
tween successive jumps of X{t) are independent exponentially distributed random variables 
of parameters rc(0), r(;(l), rc(2),... respectively. Let 


A := inf{A > 0 : p{\) < oo}. 


(4) 


Throughout this paper we impose the following condition on the weight function w. 


lim p(A) >1. (M) 


We are now ready to define our randomly growing tree T (t) which will be a continuous 
time, time-homogeneous Markov chain on the countable state space Q, with initial state 
T(0) = {0}. 

The jump rates are the following: if for a t > 0 we have T(t) = G then the process 
may jump to G U {xk} with rate 'u;(deg(x, G)) where x G G and k = deg(x, G) -|- 1. This 
means that each existing vertex x G T(t) ‘gives birth to a child’ with rate 'u;(deg(x, T(t))) 
independently of the others. 

Note that condition (ImI) implies 


OO 


E 


1 

w{k) 


= oo 


and hence it follows that the Markov chain T(t) is well defined for t G [0,oo), it does not 
blow up in finite time. A rigorous proof of this statement follows from the connection with 
general branching processes (see Section |2I) for which the related statement is derived in j^. 
We define the total weight of a tree G G as 


W{G) :=^u;(deg(x,G)). 

xeG 

Described in other words, the Markov chain T(t) evolves as follows: assuming T(t—) = G, 
at time t a new vertex is added to it with total rate W (G) which is attached with an oriented 
edge (pointing towards the newly added vertex) to the already existing vertex x G G with 
probability 

r(;(deg(x, G)) 

EyGG^(deg(y,G))' 

Therefore, if we only look at our process at the stopping times when a new vertex is just 
added to the randomly growing tree: 


Tn := inf{t : |T(t)| = n -M} 


(5) 
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then we get the discrete time model: T(T„) has the same distribution as T(i{n), the discrete 
time model at time n. 

It will sometimes be convenient to refer to the vertices in the order of their birth, not 
their genealogical code: let {%} = TiTk) \ T(Tfc—) denote the vertex that appeared at T^. 
Of course we will always have 7?o = 0 and r/i = 1. 

4 Results 

4.1 Statement of results 

From condition (|M|) it follows that the equation 

p{\) = 1 


has a unique root A*. 

Now we are ready to state our first theorem. 

Theorem 1. Consider a weight funetion w satisfying eondition (ImI) and let A* he defined 
as above. Consider a bounded function ^ > M. Then the following limit holds almost 

surely: 

E = e-^-‘E{v(T(()))d(. 

x£T{t) 

From Theorem Q several statements follow, regarding the asymptotic behavior of our 
random tree as seen from a randomly selected vertex chosen uniformly from T(t). As 
typical examples we determine the asymptotic distribution of the number of children, re¬ 
spectively, that of the whole subtree under the randomly chosen vertex, its ancestor, 
respectively. That is: the asymptotic distribution of deg(C,T(t)) G N, G G and 

In order to formulate these consequences of Theorem^ we need to introduce some more 
notation. Let G € G and one of its historical orderings s = (sq, si,..., SjG|-i) S <S{G) be 
fixed. The historical sequence of total weights are defined as 

W{G,s,i) :=W{G{s,i)) (6) 

for 0 < i < |G| — 1 while the respective weights of the appearing vertices are defined as 

w{G, s, i) := w (deg ((sj)\ G(s, i - 1))) . (7) 

for 1 < z < |G| — 1. Since deg ((si)^,G(s,z — 1)) is the degree of Sj’s parent just before Si 
appeared, w{G, s, i) is the rate with which our random tree process jumps from G{s, i — 1) 
to G{s, i). 
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Given the weight function w : N —> M+ satisfying condition (IMl) and A* defined as before 
define 

Pw{k) := 

vr^(G) := 

Theorem 2. Consider a weight funetion w which satisfies condition (M) and let \* he 
defined as before. Then the following limits hold almost surely: 


TT w{i) 

A* + w{k) A* + w(i) ’ 

E X -p-r rc(G,s,z + l) 

A* + W(G) J-J- X* + W(G,s,i)' 

s£S{G) ^ ^ i=0 V ) W 


(a) For any fixed A: G N 


|{a;gT(t):deg(x,T(t)) = fc}| _ 

|T(t)| -Pw[k). 


(h) For any fixed G € Q 


lim 

£—>CXD 


{x G T{t) : T{t)^, = G} 


|T(t)| 


= -n-wiG). 


(c) For any fixed {G,u) G 

{x G T(t) : (T(t)^^(fe),X|fc) = (G,u)} 


lim 

t^OC 


|T(t)| 


= MG). 


Furthermore, the functions pw,T^w are probability distributions on N andQ, respectively, 
and 'Kw is steady (i.e. identity 0 holds). 


Remarks. 

1. Parts (a), (b) and (c) of Theorem [2 in turn, give more and more information about 
the asymptotic shape of the randomly growing tree T(t), as seen from a random vertex ( 
chosen with uniform distribution. Part (a) identifies the a.s. limit as t —> oo, of the degree 
distribution of C- Part (b) identifies the a.s. limit as t ^ oo, of the distribution of the 
progeny of C- Finally, part (c) does the same for the distribution of the progeny of the k^^ 
ancestor of the randomly selected vertex with the position of this vertex marked. 

2. Prom part (c) it is easy to derive the asymptotic distribution of the progeny of the 
ancestor of the randomly selected vertex (as a rooted ordered tree without any marked 

vertices): 


lim 

t^OO 


{x G T{t) : = G} 

\m\ 


— t^w{G) |G[fc] 


The limit is the size-biased version of 7r^(G), with the biasing done by the size of the kfi^ 
generation. 
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3. Since the distribution vr^ is steady, part (c) identifies the asymptotic distribution of the 
whole family tree of the randomly selected vertex C, (relatives of arbitrary degree included). 
Hence asymptotically, as t —> oo, the tree T(t) viewed from a random vertex C, will have 
the following structure (we omit the precise formulation): 

- there exists an infinite path of ancestors ... ‘going back in time’, 

- we have finite ordered random trees rooted at each vertex of this path, 

- the tree rooted at C,^ with the position of C, marked on it has distribution ■Kw'^ on g{k) 
where 'Kw\g,u) = 


4.2 Linear weight function 


In the linear case w{k) = ak + (3 {a, f3 > 0) all computations are rather explicit. In this case 
the asymptotic degree distribution pw (computed in a, nn) is reproduced, of course. But 
even in this simplest case the asymptotic distribution of the subtree under a randomly 
selected vertex seems to be new. 

For sake of completeness, in the rest of this section we perform these (explicit and 
straightforward) computations for the linear case. Multiplying the rate function with a 
positive constant only means the rescaling of time in our model thus it is enough to consider 
w{k) = k + P (with P > 0). In this case it is straightforward to compute that condition (M) 
holds, p(A) = A = 1 and X* = 1 + P- Thus both Theorems ^ and El hold. 

For the asymptotic degree distribution we get 


Pwik) 


(1 + P) 


{k — 1 + P)k 

(/c + 1 + 2/3)fc_|_i ’ 


where we used the shorthanded notation 


{x)k := = r(x-ll\) ’ fc = 0>l>2,.... 

For the calculation of ^^{G) first we show that the sum which defines it contains identical 
elements. In order to avoid heavy notation, during the following computations we will use 
n := |G| — 1 and deg(x) instead of deg(x, G). 

Clearly, for any s G S{G) 

n-l /deg(x)-l \ 

]Ju;(G,s,z + I) = «^(j) = n(‘^®s(a^) - 1 + /3)deg(a:)- 

i=0 x£G y j=0 J xGG 


(Actually, the first equality holds for every weight function w.) It is also easy to see that 
for any G € G 

W{G) = ^(deg(x) +P) = |G| (l + P)- 1, 

xeG 
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thus for any s € S(G) 


n—1 

n 


A* + W(a) A* + W{a, s.i) (1 + + 2 - (1 + /3)-i)„+i 


Therefore 


n^G) = |5(G)| 


f T /^)deg(3;) 


(1 + (3)'^{n + 2 — (1 + [5)~^)n+i 
In the (3 = 1 case (i.e. if we consider random tree proposed in |^) the previous calcula¬ 
tions give 

A 

Pwik) = 


and 


MG) = 


{k l)ik -\- ‘^(jik -|- 3) 
2|5(G)| 


(2|G| + 1)!! 


n deg(3:)!. 


x&G 


The value of |5(G)| cannot be written as the function of degrees of G only, but one can 
compute it using the values |G| 3 ;| for x G G. For a given G and x = (xi, X2, • • •, Xn) G G let 
us introduce the following notations (these will not be used in the other parts of the paper): 

B{x) := {?/ G G : y = (xi, X2,..., x„_i, fc), k > x„}, 
a(x) := max(|G|a:| - 1,1), h{x) := max (Ej/gS(x) > l) • 

It is a simple exercise to prove that for a G G ^ with |G| > 1 

|cS(G)| = (|G|-2)! H a{x)M{x)-\ 

The proof is left to the reader. 

5 Branching Processes 

The random tree model, defined in continuous time, has the big advantage that it fits 
into the framework of the well-established theory of general branching processes. We give a 
brief introduction to the fundamentals and state the theorems that we rely on in our proofs. 
We do not give a broad survey on the most general types of branching processes here, we 
choose to focus on the results which may be applied to our process. For more details see 
the monograph or the papers 0, m, M and the references therein. For a survey on 
branching processes, trees and superprocesses, see ra¬ 
in the case of a general branching process, there is a population in which each individual 
reproduces at ages according to i.i.d. copies of a random point process ^ on [0,oo). We 
denote by ^{t) the ^-measure of [0, t], this the random number of children an individual has 
up to time t. 
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The individuals in the population are labelled with the elements of M, as described in 
Section |31 (see (EU)). The basic probability space is 

{Q,A,P)= W^i^x^Ax^Px)-, 

x&Af 

where {fix, Ax, Px) are identical spaces on which ^x are distributed like 
For each x € AA there is a -ix shift defined on fl by 

(^lx)y ~ ^xy, 

in plain words, is the life of the progeny of x, regarding x as the ancestor. 

The birth times ax of the individuals are defined in the obvious way: (Tq = 0 and if 
x' = xn with n G Z_|_ then 

ax’= ax + vA{t : ix{t) >n}. (10) 

The branching process is often counted by a random characteristic, this can be any 
real-valued process {<!> : M x —> M}. For each individual x, ^x is defined by 

^x{t,uj) = 

in plain words <ha;(i) denotes the value of evaluated on the progeny of x, regarding x as 
the ancestor, at the time when x is of age t. We can think about ^xit) as a ‘score’ given to 
X when its age is t. With this, 

Z?=Y^^x{t-ax) 

is the branching process counted by the random characteristic (the ‘total score’ of the 
population at time t). 

For our applications we only consider random characteristics which are 0 for t < 0 and 
equal to a bounded deterministic function of the rooted tree for t >0. 

This means that only those individuals contribute to which are born up to time t and 
their contribution is a deterministic function of their progeny tree. (Random characteristics 
may be defined in a more general way, see e.g. 101 , 0 ) One of the important examples is 
d>(t) = Il{t > 0} when is just the total number of individuals born up to time t. 

The Laplace-transform of df{t) will be of great importance, we denote this random 
variable by: 

poo 

e(A):=/ e-^‘dC(t). (11) 

Jo 

We shall be interested in supercritical, Malthusian processes, meaning that there exists a 
finite 0 < A* < oo (the so-called Malthusian parameter) for which 

Ee(A*) = l, (12) 
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and also 

poo 

= E / < oo. (13) 

Jo 

(The last property means that the process is Malthusian and the first means that it is 
supercritical.) Also, we require the reproduction to be non-lattice, which means that the 
jumps of ^(t) cannot be supported by any lattice {0, d, 2d,... }, d > 0 with probability one. 

We quote here a weaker form of Theorem 6.3 from using its extension which appears 
in Section 7 of the same paper. This way the conditions of the original theorem are fulfilled 
automatically. 

Theorem A (Nerman, |12|1. Consider a supercritical, Malthusian branching process with 
Malthusian parameter A*, counted by two random characteristics $(t) and 'l'(t) which have 
the properties described above (i.e. they are 0 fort < 0 and a deterministic bounded function 
of the progeny tree for t > 0). Suppose that there exists a X< X* for which 

Ef'(A) < oo. 


Ac = -aA(E^(A)) 


Then almost surely 


Zf ^ $(A*) 
Zf ^ T(A*) 


as t 


oo. 


(14) 


^ OO 

where 4>(A) = f exp(—At)E(<h(t)) dt. 

0 

For a reader interested in how the Malthusian parameter and 4>(A) play a role in the 
theory, we give a short indication. This part may be skipped without any confusion. 

The key observation is the so-called basic decomposition, namely that 


ieN 

The advantage of this formula is that if we know the sequence {aj)j^z+ (all tbe birth-times 
of the children of the root), then zff^. has the same conditional distribution as Zf_^,. 

Therefore, using the notation mf = ThZf, taking expectation on both sides of m in 
two steps (hrst conditionally on {crj)j^x+, then taking expectation regarding {(7j)j^z+), we 
get 


mf = E{^{t))+[ mf_,dfi{s), 
JO 


(16) 


where we used the notation /i(t) = E^(t). Taking the Laplace transform of both sides. 


m(A) = <1)(A) -|- m{X)fi{X), 


(17) 
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so formally 



(18) 


Note that /2(A) = E^(A). This shows that if there is a positive interval below A* where the 
Laplace transform is finite, then 1/(1 — /2(A)) has a simple pole at A* (it is easy to check 
that /2^(A*) < 0 and /2"(A*) > 0). So taking series expansion and inverse Laplace transform 
results that 



(19) 


where n is the finite constant defined in m- This means that the ratio of the expectations 


of and Zf indeed tends to . To get the almost sure convergence of ^ to the same 
limit needs a lot more work and of course its proof is more elaborate (see m for example). 


6 Proofs 


Proof of Theorem^\ Consider the continuous time branching process where the reproduc¬ 


tion process ^(t) is the Markovian pure birth process X{t), with rate function w, described 
at the beginning of Section |21 

Clearly, the time-evolution of the population has the same distribution as the evolution 
of the continuous time Random Tree Model corresponding to the weight function w. The 
vertices are the respective individuals and edges are the parent-child relations. 

It is also not hard to see that the function E,^(A) for the branching process is the same 
as /2(A) which means that by condition (M) we may apply Theorem A with appropriate 
random characteristics. Given any bounded function 99 : ^ > M, setting the characteristics 

$,' 1 ' as $(t) := (/ 9 (T(/))l{t > 0 } and 'k(t) := l{t > 0 } we get exactly the statement of 
Theorem □ □ 

Proof of Theorem\^ (a) Apply Theorem ^ with the function 


ip{G) :=l{deg(0,G) = fc}. 


This gives that 



almost surely. By the definition of ak (see uni): 


P (deg(0, T{t)) = k) = P (ufc < t) - P (ufc+i < t). 


Since 
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and (Tfc is the sum of independent exponentially distributed random variables with param¬ 
eters tt)(0), 'u;(l),... , w{k — 1), we get 


pc 

X* e-^‘*P(deg(0,T(t)) = A:) dt = 
Jo 


k-l 

n 


w{i) 


\* + w{k) A* -|- w{i) 


This completes the proof of part (a) of the Theorem. 

Using the identity 

/•OO 

p^{k) = X* / e“^**P(deg(0,T(t)) = A:) dt, 

Jo 

and the fact that |T(t)| is finite for every t with probability 1 it is straightforward to prove 
that pw is indeed a probability distribution on N. 

(b) Let G € ^ be hxed and denote n = |G| —1. We apply TheoremC]with ^{H) = = G). 

We need to compute 

poo 

X* e-^*‘P(T(t) =G)dt. 

Jo 

Consider the following random stopping times: 


tg ■= sup{t > 0 : T(i) C G}, 

Tq ■= sup{t > 0 : T(i) C G}. 

That is: tg is the birth time of the first vertex not in G, while Tq is the minimum of tg 
and the time when we first have T(t) = G, if the latter ever happens. Since 

P(T(t) = G) =P(T(t)CG)-P(T(t)CG) =P(A<TG)-P(t<T^), 


we get that 


A* 



g-Anp(T(i) = Q'^ 




Note that by the definition we always have Tq < tg- The event {t'q < tg} means that 
there is a A when T(t) = G. On this event Tq gives the time when we first have T(A) = G 
and Tg gives the appearance of the next vertex. Given the event {tq < tg}, the condi¬ 
tional distribution of tg — t'q is exponential with parameter W (G) and it is (conditionally) 
independent of t'q. This leads to 


E((e-"%-e-"*"«)Il{T^<rG}) 


A* 

A* + W{G) 


E(e-"*"ol{r^<rG}). 


Now, it is clear that the following two events are actually the same 


[t'q < tg} = {(r?o, ...,r]n) = (so, • • • ,s„) for some s € 5(G)}. 
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This implies that 


E(e-"*"GlL{r^<rG}) 

= E(e"^*^"l{(?7o,...,r7n) = (so,---,'Sn)})- 

sGS(G) 

For Tn see m and the definition below it. Given s £ S{G) fixed 

(See JSia), ® and 0 for the definitions.) Also, if s G <S{G) is fixed then conditionally on the 
event {{r]o ,..., rjn) = (soj • • •, Sn)} the random variables Tj^+i — T^, k = 0,1,... ,n — 1, are 
independent and exponentially distributed with parameters W{G, s,k), k = 0,1,... ,n — 1, 
respectively. This is an easy exercise: it may be proved by using the ‘lack of memory’ of 
the exponential distribution and the fact that the minimum of independent exponentially 
distributed random variables with parameters z/i, 1 / 2 ,..., is also exponentially distributed 
with parameter Hence it is straightforward to get 


E(e ^‘^"Il{(r?o,...,r?n) 


(so,--- 


n—1 


«»)))=n 

j=0 


w{G, s, i + 1) 
A* + iy(G,s,i)' 


Collecting our previous calculations part (b) of Theorem [21 follows. 

Using similar considerations as in the end of the proof of part (a) it is apparent that tt^ 
is a probability distribution on Q. 

(c) This is straightforward since for any H £ Q and {G, u) £ we have 


{x £ H : {H^^ik),xik) = (C,?/)} 


\{x £ H : = G}\. 


The statement now follows from part (b). 

The only thing left to prove is that tTw satisfies 0, i.e. it is steady. First observe, that 
if Go G ^ is fixed and ^ is a uniformly chosen random vertex in Gq then the distribution 
of F := (Go)|^ (which is a probability distribution on Q) is steady. (This follows by simple 
counting.) Equation 0 is linear in vr, therefore mixtures of steady distributions are also 
steady. Thus, if is a uniformly chosen random vertex in T(t) then the distribution of 
T(t)|^ (which is a random probability distribution on Q) is also steady. By part (b) with 
probability one these distributions converge (in distribution) to and from this an easy 
consideration shows that must satisfy 0. □ 


7 Asymptotic growth 

In our theorems we determined the asymptotic ratio of vertices in T(t) satisfying certain 
properties. It is also natural to ask if one can prove results about the asymptotic number of 
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the respective vertices. As we have seen, this essentially requires to study the asymptotic 
behavior of for a suitable random characteristic ‘h. This has been done in the framework 
of general branching processes, we shall give a short overview of the relevant results. 

We have already seen that 

E(e-^**Zf) ^ -Hxn, (20) 

where k is the constant defined in m- Thus we need to divide by * to get something 
non-trivial. Let us quote a weaker form of Theorem 5.4 of Ha- 

Theorem B (Nerman, |12|L Consider a supercritical, Malthusian branching process with 
Malthusian parameter A*. Suppose that condition (M) holds and ^ is a random character¬ 
istic with properties described in Section^ Then almost surely 

^ -4>(A*)0, ast^oo, 

K 

where Q is a random variable not depending on <h. 

The necessary and sufficient condition for the random variable 0 to be a.s 
the so-called xlogx property of the reproduction process 

E(f'(A*)log+^(A*))<cx). 

We quote Theorem 5.3 of [Zj. 

Theorem C (Jagers-Nerman, |7|). Consider a supercritical, Malthusian branching pro¬ 
cess with Malthusian parameter A*. If condition HP holds then 0 > 0 a.s. and E(0) = 1; 
otherwise 0 = 0 a.s. 

Remark. This theorem is the generalization of the Kesten-Stigum theorem, which states 
this fact for Galton-Watson processes (see 0). 

Theorem B applies for the random tree model if condition (M) is fulfilled. We do not 
intend to identify the necessary and sufficient condition on the weight function w which 
would guarantee that the corresponding reproduction process possesses property (L). Still 
it is worth pointing out that if w{k) ^ oo as t > oo, then this property holds, this by 
Theorem C, 0 is a.s. positive. 

Lemma 1. If a weight function w satisfies condition (M) and w{n) —> oo, as n ^ oo, then 
the corresponding branching process satisfies condition (L). 

Proof. We will prove the existence of the second moment of ^(A*) from which condition (L) 
trivially follows. Since ^(A*) = we need 

CO \ 2 

^g-AVfej ^ ^21) 

k=l / 



positive is 


(L) 
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The random variables (Jk+i — ffc are independent exponentials for k = 0,1, 2,... with pa¬ 
rameters r(;(0), r(;(l),... , respectively, thns a simple computation yields that the expression 
in m is equal to 


CO i 


-p(2A*) + 2^^ 
i=0 j=0 


(TT ^(0 TT ^(0 
1 J-i 2A* -I-tcf/) X* + w(l) 

\i=o ^ ’ i=j+i ^ ' 


Transforming the double sum on the right, we get 


\ ^ /TT w{l) \ \ - TT < \ " TT 

I 11 A* + 'u;(^) / ^ ^ ^ 2A* -I- w{l) ~ ^ ^ ^ 2X* + w{l) 

i=0 \l=o > J j=Q 1=0 ' \ J * /=o ' w 

where we also used p(A*) = 1. On the other hand, 


^ A* -I- w{l) _ w{l) (A -I- w{l)){X* -I- w{l)) 

^^\A2X* + w{l) + w{l){2X*+w{l)) 

Since w{l) —> oo and A < A*, we have 

(A -I- w{l)){X* + w{l)) 
w{l){2X* + w{l)) 

if I is large enough. This leads to 


TT ^ K ^(0 

^^^2X*+w{l) ^^^X + w{l) 

j=0 1=0 ' y j j=o 1=0— ^ ^ 


< Kp{X) < oo. 


by condition (M) which completes the proof of the lemma. 


□ 


The distribution of 0 is usually hard to determine from the weight function w, however, 
one can characterize its moment generating function (p{u) = Ee““®. Using the idea of the 
basic decomposition (ESI) one can write the following equation for f{u,t) := Ee 

f{u,t) = e~^EY[fiu,t-Tj). 
i>i 

By Theorem 4 


(p{u) = lim f{ue ^ *,t) (22) 

t—^OO 

which gives 

p{u) = E p{ue~^*^^). (23) 

i>i 

It can be proved that this equation characterizes (p as there is no other bounded function 
satisfying it with a right derivative —1 at 0. (See Theorem 6.8.3 in (Sj.) 
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In the linear case {w{k) = ak + (5) the distribution of 0 may be explicitly calculated. 
Using the fact that |T(t)| is now a Markov process (which is an easy exercise to prove) 
and applying standard martingale techniques one can derive a partial differential equation 
for Solving this pde and using the limit (1^^ we get the momentum generating 

function of u) from which one can identify the distribution of 0 as a Gamma distribution 
with parameters (^, ^). 
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